Overview

Dataset Statistics

Number of Variables 21
Number of Rows 4119
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 3.0 MB
Average Row Size in Memory 770.5 B
Variable Types
  • Numerical: 9
  • Categorical: 12

Dataset Insights

duration is skewed Skewed
campaign is skewed Skewed
pdays is skewed Skewed
emp.var.rate is skewed Skewed
cons.price.idx is skewed Skewed
cons.conf.idx is skewed Skewed
euribor3m is skewed Skewed
nr.employed is skewed Skewed
month has constant length 3 Constant Length
day_of_week has constant length 3 Constant Length
previous has constant length 1 Constant Length
emp.var.rate has 1735 (42.12%) negatives Negatives
cons.conf.idx has 4119 (100.0%) negatives Negatives
  • 1
  • 2

Variables


age

numerical

Approximate Distinct Count 67
Approximate Unique (%) 1.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 65904
Mean 40.1136
Minimum 18
Maximum 88
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • age is skewed right (γ1 = 0.7154)

Quantile Statistics

Minimum 18
5-th Percentile 26
Q1 32
Median 38
Q3 47
95-th Percentile 58
Maximum 88
Range 70
IQR 15

Descriptive Statistics

Mean 40.1136
Standard Deviation 10.3134
Variance 106.3654
Sum 165228
Skewness 0.7154
Kurtosis 0.4361
Coefficient of Variation 0.2571
  • age is not normally distributed (p-value 0.000978208272862551)
  • age has 39 outliers

job

categorical

Approximate Distinct Count 12
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Memory Size 304777

Length

Mean 8.993
Standard Deviation 2.1585
Median 10
Minimum 6
Maximum 13

Sample

1st row blue-collar
2nd row services
3rd row services
4th row services
5th row admin.

Letter

Count 34987
Lowercase Letter 34987
Space Separator 0
Uppercase Letter 0
Dash Punctuation 1043
Decimal Number 0

marital

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 295861
  • The largest value (married) is over 2.18 times larger than the second largest value (single)

Length

Mean 6.8284
Standard Deviation 0.599
Median 7
Minimum 6
Maximum 8

Sample

1st row married
2nd row single
3rd row married
4th row married
5th row married

Letter

Count 28126
Lowercase Letter 28126
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (married, single) take over 50.0%
  • The largest value (married) is over 2.18 times larger than the second largest value (single)

education

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 320546

Length

Mean 12.8213
Standard Deviation 4.403
Median 11
Minimum 7
Maximum 19

Sample

1st row basic.9y
2nd row high.school
3rd row high.school
4th row basic.9y
5th row university.degree

Letter

Count 47629
Lowercase Letter 47629
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1231
  • The top 2 categories (university.degree, high.school) take over 50.0%

default

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 279989
  • The largest value (no) is over 4.13 times larger than the second largest value (unknown)

Length

Mean 2.975
Standard Deviation 1.981
Median 2
Minimum 2
Maximum 7

Sample

1st row no
2nd row no
3rd row no
4th row no
5th row no

Letter

Count 12254
Lowercase Letter 12254
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (no, unknown) take over 50.0%
  • The largest value (unknown) is over 803.0 times larger than the second largest value (yes)

housing

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 278673

Length

Mean 2.6555
Standard Deviation 0.8578
Median 3
Minimum 2
Maximum 7

Sample

1st row yes
2nd row no
3rd row yes
4th row unknown
5th row yes

Letter

Count 10938
Lowercase Letter 10938
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (yes, no) take over 50.0%
  • The largest value (yes) is over 20.71 times larger than the second largest value (unknown)

loan

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 277163
  • The largest value (no) is over 5.04 times larger than the second largest value (yes)

Length

Mean 2.2889
Standard Deviation 0.8458
Median 2
Minimum 2
Maximum 7

Sample

1st row no
2nd row no
3rd row no
4th row unknown
5th row no

Letter

Count 9428
Lowercase Letter 9428
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (no, yes) take over 50.0%
  • The largest value (yes) is over 6.33 times larger than the second largest value (unknown)

contact

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 302154
  • The largest value (cellular) is over 1.81 times larger than the second largest value (telephone)

Length

Mean 8.3562
Standard Deviation 0.4789
Median 8
Minimum 8
Maximum 9

Sample

1st row cellular
2nd row telephone
3rd row telephone
4th row telephone
5th row cellular

Letter

Count 34419
Lowercase Letter 34419
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (cellular, telephone) take over 50.0%
  • The largest value (cellular) is over 1.81 times larger than the second largest value (telephone)

month

categorical

Approximate Distinct Count 10
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 280092
  • The largest value (may) is over 1.94 times larger than the second largest value (jul)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row may
2nd row may
3rd row jun
4th row jun
5th row nov

Letter

Count 12357
Lowercase Letter 12357
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (may, jul) take over 50.0%
  • The largest value (may) is over 1.94 times larger than the second largest value (jul)
  • month has words of constant length

day_of_week

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 280092

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row fri
2nd row fri
3rd row wed
4th row fri
5th row mon

Letter

Count 12357
Lowercase Letter 12357
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • day_of_week has words of constant length

duration

numerical

Approximate Distinct Count 828
Approximate Unique (%) 20.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 65904
Mean 256.7881
Minimum 0
Maximum 3643
Zeros 1
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • duration is skewed right (γ1 = 3.2936)

Quantile Statistics

Minimum 0
5-th Percentile 35
Q1 103
Median 181
Q3 317
95-th Percentile 740.2
Maximum 3643
Range 3643
IQR 214

Descriptive Statistics

Mean 256.7881
Standard Deviation 254.7037
Variance 64873.9932
Sum 1.0577e+06
Skewness 3.2936
Kurtosis 20.7353
Coefficient of Variation 0.9919
  • duration is not normally distributed (p-value 3.217070694022297e-13)
  • duration has 291 outliers

campaign

numerical

Approximate Distinct Count 25
Approximate Unique (%) 0.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 65904
Mean 2.5373
Minimum 1
Maximum 35
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • campaign is skewed right (γ1 = 4.0017)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 1
Median 2
Q3 3
95-th Percentile 7
Maximum 35
Range 34
IQR 2

Descriptive Statistics

Mean 2.5373
Standard Deviation 2.5682
Variance 6.5954
Sum 10451
Skewness 4.0017
Kurtosis 25.2524
Coefficient of Variation 1.0122
  • campaign is not normally distributed (p-value 1.2052748018093327e-18)
  • campaign has 235 outliers

pdays

numerical

Approximate Distinct Count 21
Approximate Unique (%) 0.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 65904
Mean 960.4222
Minimum 0
Maximum 999
Zeros 2
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • pdays is skewed left (γ1 = -4.7734)

Quantile Statistics

Minimum 0
5-th Percentile 999
Q1 999
Median 999
Q3 999
95-th Percentile 999
Maximum 999
Range 999
IQR 0

Descriptive Statistics

Mean 960.4222
Standard Deviation 191.9228
Variance 36834.3557
Sum 3.956e+06
Skewness -4.7734
Kurtosis 20.7858
Coefficient of Variation 0.1998
  • pdays is not normally distributed (p-value 4.612944247992616e-25)
  • pdays has 160 outliers

previous

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 271854
  • The largest value (0) is over 7.42 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 4119
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 7.42 times larger than the second largest value (1)
  • previous has words of constant length

poutcome

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 310660
  • The largest value (nonexistent) is over 7.76 times larger than the second largest value (failure)

Length

Mean 10.4212
Standard Deviation 1.4073
Median 11
Minimum 7
Maximum 11

Sample

1st row nonexistent
2nd row nonexistent
3rd row nonexistent
4th row nonexistent
5th row nonexistent

Letter

Count 42925
Lowercase Letter 42925
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (nonexistent, failure) take over 50.0%
  • The largest value (nonexistent) is over 7.76 times larger than the second largest value (failure)

emp.var.rate

numerical

Approximate Distinct Count 10
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 65904
Mean 0.08497
Minimum -3.4
Maximum 1.4
Zeros 0
Zeros (%) 0.0%
Negatives 1735
Negatives (%) 42.1%
  • emp.var.rate is skewed left (γ1 = -0.7274)

Quantile Statistics

Minimum -3.4
5-th Percentile -2.9
Q1 -1.8
Median 1.1
Q3 1.4
95-th Percentile 1.4
Maximum 1.4
Range 4.8
IQR 3.2

Descriptive Statistics

Mean 0.08497
Standard Deviation 1.5631
Variance 2.4433
Sum 350
Skewness -0.7274
Kurtosis -1.042
Coefficient of Variation 18.3956
  • emp.var.rate is not normally distributed (p-value 2.4250757959874118e-17)

cons.price.idx

numerical

Approximate Distinct Count 26
Approximate Unique (%) 0.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 65904
Mean 93.5797
Minimum 92.201
Maximum 94.767
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • cons.price.idx is skewed left (γ1 = -0.2166)

Quantile Statistics

Minimum 92.201
5-th Percentile 92.713
Q1 93.075
Median 93.749
Q3 93.994
95-th Percentile 94.465
Maximum 94.767
Range 2.566
IQR 0.919

Descriptive Statistics

Mean 93.5797
Standard Deviation 0.5793
Variance 0.3356
Sum 385454.802
Skewness -0.2166
Kurtosis -0.8238
Coefficient of Variation 0.006191
  • cons.price.idx is not normally distributed (p-value 1.4105219185267356e-09)

cons.conf.idx

numerical

Approximate Distinct Count 26
Approximate Unique (%) 0.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 65904
Mean -40.4991
Minimum -50.8
Maximum -26.9
Zeros 0
Zeros (%) 0.0%
Negatives 4119
Negatives (%) 100.0%
  • cons.conf.idx is skewed right (γ1 = 0.2872)

Quantile Statistics

Minimum -50.8
5-th Percentile -47.1
Q1 -42.7
Median -41.8
Q3 -36.4
95-th Percentile -33.6
Maximum -26.9
Range 23.9
IQR 6.3

Descriptive Statistics

Mean -40.4991
Standard Deviation 4.5946
Variance 21.1101
Sum -166815.8
Skewness 0.2872
Kurtosis -0.3154
Coefficient of Variation -0.1134
  • cons.conf.idx is not normally distributed (p-value 5.017813685399287e-15)
  • cons.conf.idx has 43 outliers

euribor3m

numerical

Approximate Distinct Count 234
Approximate Unique (%) 5.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 65904
Mean 3.6214
Minimum 0.635
Maximum 5.045
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • euribor3m is skewed left (γ1 = -0.7148)

Quantile Statistics

Minimum 0.635
5-th Percentile 0.8084
Q1 1.334
Median 4.857
Q3 4.961
95-th Percentile 4.966
Maximum 5.045
Range 4.41
IQR 3.627

Descriptive Statistics

Mean 3.6214
Standard Deviation 1.7336
Variance 3.0053
Sum 14916.364
Skewness -0.7148
Kurtosis -1.3961
Coefficient of Variation 0.4787
  • euribor3m is not normally distributed (p-value 6.97463958498238e-18)

nr.employed

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 65904
Mean 5166.4817
Minimum 4963.6
Maximum 5228.1
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • nr.employed is skewed left (γ1 = -1.0755)

Quantile Statistics

Minimum 4963.6
5-th Percentile 5008.7
Q1 5099.1
Median 5191
Q3 5228.1
95-th Percentile 5228.1
Maximum 5228.1
Range 264.5
IQR 129

Descriptive Statistics

Mean 5166.4817
Standard Deviation 73.6679
Variance 5426.96
Sum 2.1281e+07
Skewness -1.0755
Kurtosis 0.06019
Coefficient of Variation 0.01426
  • nr.employed is not normally distributed (p-value 1.3464544117120773e-17)

y

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 276424
  • The largest value (no) is over 8.13 times larger than the second largest value (yes)

Length

Mean 2.1095
Standard Deviation 0.3123
Median 2
Minimum 2
Maximum 3

Sample

1st row no
2nd row no
3rd row no
4th row no
5th row no

Letter

Count 8689
Lowercase Letter 8689
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (no, yes) take over 50.0%

Interactions

Correlations

Missing Values